首页> 外文OA文献 >Risk Aversion in Finite Markov Decision Processes Using Total Cost Criteria and Average Value at Risk

【2h】

Risk Aversion in Finite Markov Decision Processes Using Total Cost Criteria and Average Value at Risk

机译：基于总成本的有限马尔可夫决策过程中的风险规避标准和风险平均值

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper we present an algorithm to compute risk averse policies inMarkov Decision Processes (MDP) when the total cost criterion is used togetherwith the average value at risk (AVaR) metric. Risk averse policies are neededwhen large deviations from the expected behavior may have detrimental effects,and conventional MDP algorithms usually ignore this aspect. We provideconditions for the structure of the underlying MDP ensuring that approximationsfor the exact problem can be derived and solved efficiently. Our findings arenovel inasmuch as average value at risk has not previously been considered inassociation with the total cost criterion. Our method is demonstrated in arapid deployment scenario, whereby a robot is tasked with the objective ofreaching a target location within a temporal deadline where increased speed isassociated with increased probability of failure. We demonstrate that theproposed algorithm not only produces a risk averse policy reducing theprobability of exceeding the expected temporal deadline, but also provides thestatistical distribution of costs, thus offering a valuable analysis tool.

机译：在本文中，当总成本标准与风险均值（AVaR）指标一起使用时，我们提出了一种在马尔可夫决策过程（MDP）中计算风险规避策略的算法。当与预期行为的较大偏差可能产生有害影响时，需要采取规避风险的策略，而常规MDP算法通常会忽略此方面。我们为基础MDP的结构提供了条件，以确保可以有效推导和解决确切问题的近似值。我们的发现是新颖的，因为以前没有将风险均值与总成本标准相关联。我们的方法在快速部署场景中得到了证明，其中，机器人的任务是在时间期限内到达目标位置，在该期限内，速度的增加与故障概率的增加有关。我们证明了所提出的算法不仅产生了规避风险的策略，减少了超过预期时间期限的可能性，而且提供了成本的统计分布，从而提供了有价值的分析工具。

著录项

作者
Carpin, Stefano; Chow, Yin-Lam; Pavone, Marco;
展开▼
作者单位

展开▼
年度 2016
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. Solutions of the average cost optimality equation for finite Markov decision chains: risk-sensitive and risk-neutral criteria [J] . Rolando Cavazos-Cadena Mathematical Methods of Operations Research . 2009,第3期

机译：有限马尔可夫决策链的平均成本最优方程的解：风险敏感和风险中立的准则
2. Solutions of the average cost optimality equation for finite Markov decision chains: risk-sensitive and risk-neutral criteria [J] . Cavazos-Cadena R Mathematical methods of operations research . 2009,第3期

机译：有限马尔可夫决策链的平均成本最优方程的解：风险敏感和风险中立的准则
3. Solution to the risk-sensitive average cost optimality equation in a class of Markov decision processes with finite state space [J] . Rolando Cavazos-Cadena Mathematical methods of operations research . 2003,第2期

机译：具有状态空间的一类马尔可夫决策过程中风险敏感的平均成本最优方程的求解
4. Risk aversion in finite Markov Decision Processes using total cost criteria and average value at risk [C] . Stefano Carpin, Yin-Lam Chow, Marco Pavone IEEE International Conference on Robotics and Automation . 2016

机译：使用总成本标准和风险均值的有限马尔可夫决策过程中的风险规避
5. Controlled Markov chains with risk-sensitive average cost criterion. [D] . Brau Rojas, Agustin. 1999

机译：具有风险敏感平均成本准则的受控马尔可夫链。
6. The Cost-Effectiveness of Community-Based Screening for Oral Cancer in High-Risk Males in the United States: A Markov Decision Analysis Approach [O] . Raj C. Dedhia, Kenneth J. Smith, Jonas T. Johnson, -1

机译：马尔可夫决策分析方法：以社区为基础的筛查口腔癌的高危男性在美国的成本效益
7. Continuous-time Markov Decision Processes with Finite-horizon Expected Total Cost Criteria [O] . Wei, Qingda, Chen, Xian 2014

机译：具有有限时域期望的连续时间马尔可夫决策过程总成本标准

Risk Aversion in Finite Markov Decision Processes Using Total Cost Criteria and Average Value at Risk

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅